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Abstract 

o 

(N 

Motivated by a host of recent applications requiring some amount of redundancy, frames are becoming 
a standard tool in the signal processing toolbox. In this paper, we study a specific class of frames, known 
^sO ' as discrete Fourier transform (DFT) codes, and introduce the notion of systematic frames for this class. 

This is encouraged by a new application of frames, namely, distributed source coding that uses DFT 
codes for compression. Studying their extreme eigenvalues, we show that, unlike DFT frames, systematic 
DFT frames are not necessarily tight. Then, we come up with conditions for which these frames can be 



tight. In either case, the best and worst systematic frames are established in the minimum mean-squared 
reconstruction error sense. Eigenvalues of DFT frames and their subframes play a pivotal role in this 
work. Particularly, we derive some bounds on the extreme eigenvalues DFT subframes which are used 
to prove most of the results; these bounds are valuable independently. 

' Index Terms 

o 

BCH-DFT codes, frames, systematic frames, parity, eigenvalue, optimal reconstruction, quantization, 
erasures, distributed source coding, Vandermonde matrix. 



I. Introduction 

Frames, "redundant" set of vectors used for signal representation, are increasingly found in signal 
processing applications. Frames are more general than bases as frames are complete but not necessarily 
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linearly independent. A basis, on the contrary, is a set of vectors used to "uniquely" represent a vector 
as a linear combination of basis elements. What would be the benefit of representing a signal with 
overcomplete set of vectors? Frames are generally motivated by applications requiring some level of 
redundancy. They offer flexibility in design and have a variety of applications. Frames show resilience 
to additive noise (including quantization error), robustness to erasure (loss), and numerical stability of 
reconstruction. With increasing applications, frames are becoming more prevalent in signal processing. 

In this paper, we study a specific class of frames known as discrete Fourier transform (DFT) codes. By 
using these codes, the ideas of coding theory are described in the signal processing setting. We consider 
the Bose-Chaudhuri-Hocquenghem (BCH) codes, an important class of multiple-error-correcting codes, 
in the DFT domain Q-JH. BCH-DFT codes are cyclic codes in the complex (or real) domain, similar 
to BCH codes in the binary error correction setting. Therefore, their codewords have certain successive 
spectral components equal to zero. This property is then exploited for error detection and correction in 
the complex (real) field 0-19). 

From Frame theory perspective, DFT codes are harmonic tight frames. In the absence of erasure, 
tight frames minimize the mean-squared error (MSE) between the transmitted and received signals ifTUl - 
|[T2ll . The MSE is the ultimate measure of performance in many digital communication systems where 
quantized analog signal is transmitted. Frames are naturally robust to transmission loss since they provide 
an overcomplete expansion of signal |[T0l - |[T4l . 

DFT frames have recently been applied in the context of distributed source coding (DSC) [15]. More 
precisely, BCH-DFT codes are used for compression of analog signals with side information available at 
the decoder. In DSC context, side information is viewed as corrupted version of signal, and compression 
is achieved by sending only redundant information, in the form of parity or syndrome, with respect to a 
channel code ifToll . Unlike in DSC that uses binary channel codes for compression, in the new framework 
(DSC based on BCH-DFT codes) compression is performed before quantization. As a result, DFT frames, 
which are primarily used for compression, can decrease quantization error at the same time. This results 
in a better reconstruction, in the MSE sense, particularly when the sources are highly correlated. 

Motivated by its application in parity-based DSC that uses DFT codes lfT5l . we introduce the notion 
of systematic frames, in this work. For an (n, k) frame, a systematic frame is defined to be a frame that 
includes the identity matrix of size k as a subframe. Since tight frames minimize reconstruction error 
|[T0l - |[T3l . we explore systematic tight DFT frames. Although it is straightforward to construct systematic 
DFT frames, we prove that systematic "tight" DFT frames exist only for specific DFT codes. More 
precisely, we show that a systematic frame is tight if and only if data (systematic) samples are circularly 
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equally spaced, in the codewords generated by that frame. When such a frame does not exist, we will 
be looking for systematic DFT frames with the "best" performance, from the minimum mean-squared 
reconstruction error sense. We also demonstrate which systematic frames are the "worst" in this sense. 
In addition, we show that circular shift and reversal of the vectors in a DFT frame does not change the 
eigenvalues of the frame operator. We use these properties to categorize different systematic frames of 
an (n, k) DFT frame based on their performance. 

Another main contribution of this paper is to find bounds on the extreme eigenvalues of V H V , where 
V is a square or non-square subframe of a DFT frame. The properties of the eigenvalues of such frames 
are central to establish many of the result in this paper. These bounds are used to determine the conditions 
required for a systematic frame so as to be tight. Besides, eigenvalues are crucial in establishing the best 
and worst systematic frames. 

The paper is organized as follows. In Section [ill we present the basic definitions and a few fundamental 
lemmas that will be used later in the paper. In Section [TTlJ we introduce DFT frames and set the ground 
to study the extreme eigenvalues of their subframes. Section [TV] motivates the work in this paper by 
introducing systematic DFT frames and their application. Some result on the the extreme eigenvalues 
of DFT frames and their subframes are presented in Section [Vj Sections [VI] and IVIII is devoted to the 
evaluation of reconstruction error and classification of systematic frames based on that. We conclude in 
Section IVlTTl 

For notation, we use boldface lower-case letters for vectors, boldface upper-case letters for matrices, 
(.) T for transpose, (.) for conjugate transpose, (.)' for pseudoinverse, (.)* for conjugate, tr(.) for the 
trace, E(.) for the mathematical expectation, and |.| for the Euclidean norm. The dimensions of square 
and rectangular matrices are indicated, respectively, by one and two subscripts when required. 

II. Definitions and preliminaries 

In this section, we introduce the definitions and some basic results which are frequently used in the 
paper. 

Definition 1. A spanning family of n vectors F = {/«}™ =1 in a ^-dimensional complex vector space C k 
is called a. frame if there exist < a < b such that for any x G C k 

n 

a\\x\\ 2 < Y,\(*Ji)\ 2 < H*\\\ (1) 
i=i 

where (as, fi) denotes the inner product of x and /j and gives the ith coefficient for the frame expansion 
of x |[T2l - |[T4l . a and b are called frame bounds; they, respectively, ensure that the vectors span the space, 
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and the basis expansion converges. A frame is tight if a = b. Uniform or equal-norm frames are frames 
with same norm for all elements, i.e., = \\fj\\, for i,j = l,...,n. 

Definition 2. An n x n Vandermonde matrix with unit complex entries is defined by 



W 



1 

6*1 



1 



V 



,j(n-l)6i p j(n-l)d 2 



1 

e j0 n 

e j(n-l)0„ 
2tt , 



\ 



(2) 



in which 9 P 6 [0, 2ir) and 9 P ^ 9 q for p ^ q, 1 < p, q < n. If 9 P = ^-(p — l),W becomes the well-known 
IDFT matrix ifTTl . For this Vandermonde matrix we can write ||T8l , |[T9l 



det(WW H ) = | det(W)\ 2 = — j J 



J9 r , 



J6a\2 



(3) 



l<p<q<n 



Central to this work is the properties of the eigenvalues of V H V or VV H , in which V is a submatrix 
of a DFT matrixO Hence, we recall some bounds on the eigenvalues of Hermitian matrices which are 
used in this paper. Let A be a Hermitian k x k matrix with real eigenvalues {\\(A), . . . , Xk(A)} which 
are collectively called the spectrum of A, and assume Xi(A) > \2{A) > ••• > \k{A). Schur-Horn 
inequalities show to what extent the eigenvalues of a Hermitian matrix constraint its diagonal entries. 



Proposition 1. Schur-Horn inequalities / |20|/ 

Let A be a Hermitian k x k matrix with real eigenvalues Ai(^4) > \2{A) > ■ ■ ■ > A^(^4). Then, for any 
1 < i\ < i% < • • • < ij < k, 

Afc-mO 4 ) H 1" Afe(A) < a ilh H h Oiji, 

<Ai(A) + --- + A/(A), (4) 

w/iere an, • • • j Ofcfc diagonal entries of A. Particularly, for I = 1 and I = k we obtain 

(5) 
(6) 



Xk(A) < a u < Xi(A), 

k k 



i=i 



i=i 



'Note that eigenvalues of V V and VV H are equal for a square V; also, V H V and VV H have the same nonzero eigenvalues 
for a non-square V. 
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Another basic question in linear algebra asks the degree to which the eigenvalues of two Hermitian 
matrices constrain the eigenvalues of their sum. Weyl's theorem gives an answer to this question in the 
following set of inequalities. 

Proposition 2. Weyl inequalities / T20l/ 

Let A and B be two Hermitian kxk matrices with spectrums {\\{A), . . . , Afc(^4)} and {Xi(B), . . . , Xk(B)}, 
respectively. Then, for i, j < k, we have 

X i (A + B)<X j (A) + X M+1 (B) for j < i, (7) 
X i (A + B)>X j (A) + X k+i - j (B) for j>i. (8) 

Corollary 1. If A + B = 7/^, 7 > 0, where A and B are Hermitian matrices, then Xj{A) + Xk-j+i(B) = 
7- 

Proof: It suffice to set i = k and i = 1 respectively in (0 and ([8]), and use Xk{A+B) = X\{A+B) = 
7 which is obtained from A + B = 7/^. ■ 

Lemma 1. Let A and B be two Hermitian kxk matrices and suppose that, for every 1 < i,j < k, 
Aij = e^ ei Bij; then A H A and B H B have the same spectrum. 

Proof: The proof is immediate using Lemma 3 |fT9l since {A H A)ij = ^-(B H B)ij\ i.e., A H A = 
B H B. m 



III. DFT Frames 

A. BCH-DFT Codes 

DFT codes O, are linear block codes over the complex field whose parity-check matrix H is defined 
based on the DFT matrix. A Bose-Chaudhuri-Hocquenghem (BCH) DFT code is a DFT code that insert 
d — 1 cyclically adjacent zeros in the frequency-domain function (Fourier transform) of any codeword 
where d is the designed distance of that code [2]. Real BCH-DFT codes, a subset of complex BCH-DFT 
codes, benefit from a generator matrix with real entries. The generator matrix of an (n, k) real BCH-DFT 
coddflis typically denned by (H, flU, CD, id 

G = J^W*ZW k , (9) 

2 Real BCH-DFT codes do not exist | 3) when n and k are simultaneously even. 
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in which W\ represents the DFT matrix of size I, and S is defined as 

1 1. o\ 



-.x/.- , (10) 

where a = |~n/2] — [(n — k)/2\, [3 = k — a, and the sizes of zero blocks are such that £ is an n x k 
matrix JSJ. One can check that S^S = I*., and T>T> H is an n x n matrix given by 



( I a ^ 



H 



V 





Ip j 



(11) 



The above structure guarantees a real BCH code. Note that, having n — k consecutive zero rows, £ inserts 
n — k consecutive zeros to each codeword in the frequency domain, which is required to make a BCH 
code 

Remark 1. Removing VFfc from © we end up with a complex G, representing a complex BCH-DFT 
code. In such a code, a and /3 can be any nonnegative integers such that a + /3 = k. 

The parity-check matrix H, both in real and complex codes, consist of the n — k columns of 
corresponding to the zero rows of E; thus, HG = 0. 



B. Connection to Frame Theory 

The generator matrix G in © can be viewed as an analysis frame operator. In this view, a real BCH- 
DFT code is a rotation of the well-known harmonic frames |[T3l , |[T4l . and a complex BCH-DFT code 
is basically a harmonic frame. The latter can be understood by removing from © which results in 
a complex BCH-DFT code, on the one hand, and the analysis frame operator of a harmonic frame, on 
the other hand. The former is then evident as is a rotation matrix. Further, it is easy to see that the 
frame operator G H G and Gramian GG H are equal to 



G H G — —Ik, 



GG H = '±W%Y.-Z H W n . 
k 



(12) 
(13) 



The following lemma presents some properties of the frame operator and relevant matrices which are 
crucial for our results in this paper. 

Lemma 2. Let G pX k be a matrix consisting of p arbitrary rows of G defined by ©. Then, the following 
statements hold: 
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i. GG H is a Toeplitz and circulant matrix 

ii. G p xkGp Xk , 1 < p < n is a Toeplitz matrix 

Hi. All principal diagonal entries of G pxk G^ xk , 1 < p < n are equal to 1. 



Proof: Let a r>s be the (r, s) entry of the matrix GG then it can readily be shown that 



, rs =- e jm(e r ~9 s ) + 1 y e jm{9 r -0 3 ) 

r ' s k ^ k ^ 



(14) 




in which 9 X = ^(x — 1). From this equation, it is clear that a Ty 



T.S — 



a r +i jS +i; that is, the elements of 



each diagonal are equal, which means that GG is a Toeplitz matrix. In addition, we can check that 
a r ,n = Or+i,i> i.e., the last entry in each row is equal to the first entry of the next row. This proves that 
the Toeplitz matrix GG H is circulant as well (21]]. Also, a quick look at (fl4l) reveals that the elements of 
the principal diagonal (r = s) are equal to 1. Similarly, one can see that for any 1 < p < n, the square 



Considering Remark [T] one can check that (fT4b is also valid for complex BCH-DFT codes. Note that, 
a and /3 are less constrained for these codes, as mentioned in Remark [TJ 

Remark 2. Lemma |2] also holds for complex BCH-DFT codes. 

Further, in a DFT code, in general, the n — k zero rows of £ are not required to be successive if they 
are not designed for error correction. That is any matrix that can be rearranged as [/& | Okxn-k] T ma y 
represent S. Then, T,T, H is not necessarily in the form given in (TTTj) : it can be any square matrix of size 
n with k nonzero elements equal to 1 , arbitrary located on the main diagonal. Then, again Lemma |2] 
holds because a rs = \ YaZq e jmde ^ ds) and rm € {1, . . . , n}. 

Remark 3. Lemma [2] holds for all DFT codes. 

As a result, all properties that we prove in the remainder of this paper are valid for "any" DFT code, even 
though they are formally proved for real BCH-DFT codes, which we simply refer to as "DFT codes" or 
"DFT frames" hereafter. 



A. Motivation 

In the context of channel coding, there is a special interest in systematic codes |2 ] since the input data 
is embedded in the encoded output which simplifies the encoding and decoding algorithms. For example, 



matrix G pxk G^ xk 



is also a Toeplitz matrix; it is not necessarily circulant, however. 



IV. Systematic DFT Frames 
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in systematic convolutional codes data can be read directly if no errors are made, or in case errors occur 
in erasures and affect only parity bits. 

Systematic codes are also used in parity-based distributed source coding (DSC) techniques, e.g., DSC 
that uses turbo codes for compression ll22ll - |[24l . DSC addresses the problem of compressing correlated 
sources by separate encoding and joint decoding and has found application in sensor networks and video 
compression lTT6l . In DSC, compression is usually realized through the use of binary channel codes. 

Recently, the authors has introduced a new framework that uses real-number codes for DSC. Specifi- 
cally, by using BCH-DFT codes it has been shown that this framework can result a better compression 
compared to the conventional one [15]. Although parity-based and syndrome-based approaches are equal 
in a DSC that uses binary channel codes for compression, the parity-based approach is more worthwhile 
in the DSC that uses DFT codes, as it is more "efficient" than the syndrome-based approach |fl5l . This is 
due to the fact that syndrome is a complex vector, even in a real DFT code, whereas parity is real. The 
parity-based approach requires a systematic generator matrix for DFT codes and is the main motivation 
of this work. 



B. Construction 

A systematic generator matrix for a real BCH-DFT code can be obtained by |fT51 

G sys = GG k 1 , (15) 

in which Gk is a submatrix (subframe) of G including k arbitrary rows of G. Note that Gk is invertible 
since it can be represented as 

G k = ^Wg^Wj, = V k H W k , (16) 

in which V k = \/jW k H xn T> and Wk are invertible as they are Vandermonde and DFT matrices, 
respectively. This indicates that any k rows of a DFT frame make a frame and proves that systematic 
DFT frames exist for any DFT frame. It also suggests that, for each DFT frame, there are many (but, a 
finite number of) systematic frames since the rows of can be arbitrarily chosen from those of G. This 
will be discussed in detail later in Section IVII-CI The codewords generated by these systematic frames 
differ in the "position" of systematic samples (i.e., input data). 

The construction in (|T5T > implies that the parity samples are not restricted to form a consecutive block 
in the associated codewords. Such a degree of freedom is useful in the sense that one can find the most 
suitable systematic frames for specific applications (e.g., the one with the smallest reconstruction error.) 
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C. Optimality Condition 

From rate-distortion theory, it is well known that the rate required to transmit a source, with a given 
distortion, increases as the variance of the source becomes larger [25]. Particularly, for Gaussian sources 
this relation is logarithmic with variance, under the mean-squared error (MSE) distortion measure. In 
DSC that uses real-number codes ifTSl . since coding is performed before quantization, the variance of 
transmitted sequence depends on the behavior of the encoding matrix. In syndrome approach, s = Hx 
|[T5l and it can be checked that a s = a x , that is, the variance is preserved^ However, as we show 
shortly, this is not valid in parity approach and the variance of parity samples depends on the behavior 
of encoding matrix G sys . In view of rate-distortion theory, it makes a lot of sense to keep this variance 
as small as possible. Not surprisingly, we will show that using a tight frame (tight G sys ) for encoding is 
optimal. 

Let x be the message vector, a column vector whose elements are i.i.d. random variables with variance 
o 2 x , and let y = G sys x represent the codeword generated using the systematic frame. The variance of y 
is then given by 

a 2 y = -E{y H y} = -E{x H G? ys G sys x} 
n n 



and 



tr (C^sCsys) — tr \ G k 1H G H GG 



Y-l 



= (18) 

k 

n x ^ 1 

= k^-i V 

i=i 

in which Ai > A2 > • • • > > are the eigenvalues of G^G^ (or VJrV^ equivalently). 

This shows that the variance of codewords, generated by a systematic frame, depends on the submatrix 
Gk which is used to create G sys . Gk, in turn, is fully known once the position of systematic (data) samples 
is fixed in the codeword. In other words, the "position" of systematic samples determines the variance of 

3 In general, any unitary matrix U preserves norms, i.e., for any complex vector x, ||f7a;|| = Note that H is not unitary 
because it is not a square matrix; however, its rows are selected from a unitary matrix and are orthonormal. This lead to 
HH H = I n - k , and tr(H H H) = n-k. 



July 27, 2012 



DRAFT 



10 



codewords generated by a systematic DFT frame. To minimize the effective range of transmitted signal, 
we need to do the following optimization problem, in light of (fTTT ) and (fT8T >. 



k 1 

minimize > — 

A, Xi 
k 

s.t. Ai = fc, Ai > 0, 

1=1 

where, the constraint Aj = /c is achieved in consideration of Lemma [2] and ((6]). 

By using the Lagrangian method, we can show that the optimal eigenvalues are Ai = 1; this implies a 
tight frame ifTUl . In the sequel, we analyze the eigenvalues of G pX kGp Xk , p < n, that helps us characterize 
tight systematic frames, so as to minimize the variance of transmitted codewords. 

V. Main Results on the Extreme Eigenvalues 

In this section we investigate some bounds on the eigenvalues of G pX kG^ xk where G is defined in 
©. These bounds play an important role in the performance evaluation of the systematic DFT frames. 
We also determine the exact values of some eigenvalues in certain cases. 

Theorem 1. Let G pX k, 1 < p < n be any p x k submatrix of G Then, the smallest eigenvalue of 
GpxkGp Xk is no more than one, and the largest eigenvalue of G px j t Gp Xk is at least one. 

Proof: From Lemma |2j we know that all principal diagonal entries of G pX kG^ xk are unity. As a 
result, using the Schur-Horn inequality in ©, we obtain X m in(G pX kGp Xk ) < 1 < A max (G pX fcG^ xA .). 
This proves the claim. ■ 
Note that \i{G pxk G^ xk ) = Xi(G^ xk G pxk ) for any G pxk . Nevertheless, this is not correct for A min 
in general. A tighter bound on Ai can be achieved when G pxk is a talfl matrix. 

Theorem 2. Given a tall (short) G pxk , the largest (smallest) eigenvalue of G^ xk G pxk is lower (upper) 
bounded by p/k. 

Proof: Let p> k. Since all diagonal entries of G pX kG^ xk are unity, from Q we have Yh=i \{G pxk G px 
p. On the other hand, since the nonzero eigenvalues of G pxk G p 1 xk and G^ xk G pX k are equal, G pX kG^ xk 

4 An m x n matrix A is called to be tall if m > n. Similarly, if m < n, then A is a short matrix. 
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has k nonzero eigenvalues and we get 



i=l 
A' 



^2 ^i(GpxkGp 



T pxk) 

< kXi(G pxk G pxk ). 

Thus, for any tall G pxk , Xi(Gp Xk G pxk ) = ^i(G pxk G p 1 xk ) > | > 1. Following a similar line of proof, 
for a short submatrix (p < k) we obtain X m \ n (G I ^ )<k G pxk ) < | < 1. 

Obviously the same bounds are valid for the extreme eigenvalues of G pxk G^ xk . What is more, 
since p/k is the average value of eigenvalues, considering that X m - m (G pxk Gp Xk ) = for p > k, and 
Xmm{Gp Xk G pxk ) = for p < k, from (1201 ) we conclude that corresponding bounds on the largest 
eigenvalues are strict. ■ 

It is worth noting that in d20l the equality is achieved when p = n; it can also be achieved for "specific" 
submatrices only in the case of integer oversampling, i.e., when n = Mk, as we discuss later in this 
paper. 

We use the above results to find better bounds for the extreme eigenvalues of G k G k in the following 
theorem. 

Theorem 3. For any G k , a square submatrix of G in © in which n ^ Mk, the smallest (largest) 
eigenvalue of G k G k is strictly upper (lower) bounded by 1. 

Proof: See Appendix IIX-AI ■ 
Theorem [3] implies that for n ^ Mk we cannot have "tight" systematic frames. Because, for a frame 
with frame operator F H F, the tightest possible frame bounds are, respectively, a = X m - m (F H F) and 
b = A max (F H F) [26]. In other words, for a tight frame X min (F H F) = X max (F H F); i.e., the eigenvalues 
of F H F are equal iflOl . 

Corollary 2. Tight systematic DFT frames can exist only if n = Mk, where M is a positive integer. 

Note that systematic DFT frames are not necessarily tight for n = Mk. In Section I VII I we prove that 
tight systematic DFT frames exist for n = Mk and show how to construct such frames. 

In the remainder of this section, we shall find exact values, rather than bounds, for some of the 
eigenvalues of G k G k when k < n < 2k. This range of n is specifically important in distributed source 
coding. 
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Theorem 4. For any Gk, a square submatrix of G in (J9]), where k < n < 2k, the 2k — n largest 
eigenvalues of ' GkGf? are equal to n/k. 

Proof: From Corollary Q] we know that if two Hermitian matrices sum up to a scaled identity matrix, 
their eigenvalues add up to be fixed. Thus, if A and B have the same spectrum we obtain 

X j (A) + X k _ j+1 (A)= 1 . (21) 

Gh 



Now, let G be partitioned as G 



then A + B = G H G = r U k . Clear 



where p = n — k. Let A = G k Gk and B = G^ xk G pX k, 

Gpxk 

y, Corollary [T] holds with 7 = r- Also, note that when p < k then 
Gp Xk G pX k has only p nonzero eigenvalues. Therefore, in such a case, k — p largest eigenvalues of G k Gk 
are equal to n/k. ■ 
Another interesting case arises when n = 2k. Numerical results shows that under this condition, A and 
B have the same set of eigenvalues. We prove this when Gk either includes successive or every other 
rows of G. In such cases, one can verify that (Gk)ij = e? e (Gk)i,f, thus, Lemma [TJ holds and A and B 
have the same eigenvalues. Hence, from (fJT)) we get 



^j(GkGk) + \k-j+i{Gk Gk) — — — 2. (22) 

This further implies that for odd values of k the middle eigenvalue of G k Gk is 1. 

We close this section with an example illustrating some of the above properties. Consider an (n, k) 
DFT frame and the the following two cases. First, the rows of Gk are evenly spaced rows of G (i.e., 
either odd rows or even rows). This is the "best" submatrix in the sense that it minimizes the MSE. 
For such a submatrix, all eigenvalues are known to be equal, as it is a DFT matrix. For example, for 
n = 10, k = 5, the best square submatrix results in A = 1 with multiplicity of 5. The other extreme 
case, which maximizes the MSE, happens when the rows of Gk are circularly consecutive rows of G. 
Again, for the above example, A = {0.0011,0.1056,1,1.8944,1.9989}. With these examples in mind, 
we will explore the best and worst frames in Section IVlTl We shall now discuss signal reconstruction for 
systematic frames. 

VI. Performance Analysis 

In this section, we analyze the performance of quantized systematic DFT codes using the quantization 
model proposed in [ 10], which assumes that noise components are uncorrelated and each noise component 
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qi has mean and variance a 2 , i.e., for any i,j, 

E{ % }=0, E{q l q j } = a 2 5 ij . (23) 

For one thing, q can be uniformly distributed on [— A/2, A/2], where a 2 = A 2 /12. We assume the 
quantizer range covers the dynamic range of all codewords encoded using the systematic DFT code in 
(ED). 

Let x be the signal (message) to be transmitted. The corresponding codeword is generated by 

y = G sys x. (24) 

This is then quantized to y and transmitted. Assuming the quantization model in d23l . transmitted 
codeword can be modeled by 

y = G sys x + q, (25) 

where q represents quantization error. This also models the received codvector provided that there is no 
error or erasure in channel. Now, suppose we want to estimate x from d25l ). This can be done through 
the use of linear or nonlinear operations. 

A. Linear Reconstruction 

We first consider linear reconstruction of x form y using the pseudoinverse [10] of G sys , which is 
defined by 

k 

Gsys = {G sys G syB ) 1 G Bys = —GkG H . (26) 
The linear reconstruction is hence given by 

x = -G k G H y = x + -G k G H q, (27) 

n n 

where q represents quantization error. 

Let us now evaluate the reconstruction error. The mean-squared reconstruction error, due to the 
quantization noise, using a systematic frame can be written as 

MSE q = iE{||i- a; || 2 } = iE{||Gt ys g|| 2 } 



^{q H G\^G\ ls q} = ia 2 tr (g^G^ 



= ^a 2 q tr(GG%G k G H ) (28) 
= ^y q tr(G«G k G H G) 
= -o-gtr (G k G k ) = -a 2 
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where the last step follows because of Lemma [H This shows that DFT codes reduce quantization error. 
The MSE is inversely proportional to the redundancy of the frame, which is a well-known result fl2l . 

ED- 

The above analysis indicates that the MSE is the same for all systematic DFT frames of the same size, 
no matter they are tight or not. This is however assuming that the effective range codewords generated by 
different G sys is equal, which implies the same a 2 for a given number of quantization levels. However, 
from (fTTl ) it is known that, for a fixed number of quantization levels, depends on the variance of 
transmitted codewords (<jy) if the quantizer is designed to cover the entire effective range of codewords. 
Obviously, though, a 2 can vary from one systematic frame to another, as shown in ( fT8l ). 

Theorem 5. When encoding with a systematic DFT frame in d 1 5b and decoding with linear reconstruction, 
for the noise model (123b and given a same number of quantization levels, the MSE is minimum if and 
only if the systematic frame is tight. 

Proof: All systematic DFT frames amount to a same quantization error provided that the effective 
range of codewords are fully covered, as shown in (f28T >. Nevertheless, for a fixed number of quantization 
levels more codewords are within the range of quantizer if the systematic frame is tight. This is clear from 
(fT8l) and ( fl9l) , recalling that ( fl9l ) is minimized by the tight frames. Moreover, any frame that minimizes 
( fl9l ) is required to be tight. This will be proved in Section IVII-Aj ■ 
The problem we are considering in Theorem [5] is somewhat the dual of that in [12, Theorem 3.1]. 
Note that in lfl2l Theorem 3.1] "uniform" frames are used for encoding which implies the same variance 
for all samples of codewords whereas the reconstruction error is proportional to Yli=i *V On the other 
hand, the frames (G sys ) used in Theorem [5] are not uniform in general; this result in a codeword variance 
proportional to Yl%=i ^« while having a fixed, minimum reconstruction error. 

B. Consistent Reconstruction 

Linear reconstruction is not always the best one can estimate x from y. Although linear reconstruc- 
tion is more tractable, consistent reconstruction is known to give significant improvement over linear 
reconstruction in overcomplete expansions |27l - |[29l . Asymptotically, the MSE is 0(r~ 2 ) for consistent 
reconstruction, where r = n/k is the frame redundancy |28l . As it can be seen from (1281 ), for linear 
reconstruction this is 0(r _1 ). The improvement, in consistent reconstruction, is due to using deterministic 
properties of quantization rather than considering quantization as an independent noise as in ( |23l . 

Although the MSE in consistent reconstruction is approximated by cr~ 2 , where the constant c depends 
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on the source and quantization, this is verified only if the oversampling ratio r is very high [29]. In some 
practical applications of frames, e.g., channel coding, this ratio cannot be high, though. Particularly, in 
the context of our interest, i.e., DSC, r is limited to two 031, Besides, consistent reconstruction methods 
does not provide a guidance on how to design the frame, as they do not point out how to compute the 
constant c. More importantly, d28l ) proves to be predictive of the performance of consistent reconstruction 
ifTOlk therefore, it can be convincingly used as a design criterion regardless of the reconstruction method. 

C. Reconstruction with Error and Erasure 

In the context of channel coding, DFT codes are primarily used to provide robustness against channel 
impairments which can be errors or erasures. Likewise, in DSC these codes play the role of channel 
codes to combat the errors due to the virtual correlation channel |fl5l . Thus, it makes sense to evaluate 
the performance of these codes in the presence of error. To this end, let y = Gx + r\ where r\ = q + e. 
Assuming that the quantization and channel errors are independent, we will have 



where v is the average number of errors in each codeword and E{e T e} = va\. Note that K{e T q} = 
E{q T e} = 0, because q and e are independent and q has mean equal to zero. Finally, following a similar 
analysis as in d28l ), we obtain 



From (l30l it is clear that reconstruction error has two distinct parts, one due to quantization error and 
one due to channel error. It also proves that DFT codes decrease both channel and quantization errors 
by a factor of frame redundancy r = n/k. The above results is for the case when no error correction is 
done. It is worth noting that, even without correcting errors, the MSE can be smaller than quantization 
error. 

As another extreme case, let us consider the case when error localization is perfect, i.e., errors are 
in the erasure form. Then, we remove the corrupted samples and do reconstruction using the error-free 
samples. This approach does not require error correction in order to reconstruct the message; however, 
it is shown to be equal to the coding theoretic approach irTTI . Let yn and r/R denote remaining rows of 
y and rj, respectively. Obviously, tjr includes only quantization error, hence we represent r)R with q^. 



E{t/ 77} = E{q 2 q + q 1 e + e 1 q + e 1 e} 

= na 2 q + vol, 



(29) 




(30) 
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Also, let F denote the rows of C7 sys corresponding to q R . Then, we can write 



y R = Fx + q R 



(31) 



x = F j y R , 



(32) 



where Ft = (F H Fy 1 F H . Thus, similar to <[28]> we will have 



MSE q+p = ±E{\\x - x\\ 2 } = ±-E{\\F^q R \\ 2 } 




(33) 




assume at least k samples are intact which implies ^ > 0. 

One nice property of systematic frames is that reconstruction error cannot be more than quantization 
error as long as systematic samples are intact. This holds even if consecutive samples are erased. We 
know that consecutive erasures can increase the MSE very fast (e.g., see |[TT1 Table I]). This can be 
understood from d33l since F contains I k as a subframe and in the worst case we can use this subframe 
for reconstruction which leads to MSE q+p = a 2 . Adding any other row (sample) will decrease the MSE. 



To show this, let F H = [I k \ E H ). Then, F H F = I k + E H E and, from ©, for i = j, we get m > 1 + £ k 
for i = 1, . . . , k, where is the smallest eigenvalue of E H E. Clearly, > since E H E is a positive 



by adding new rows. 

Finally, with consistent reconstruction, we can further decrease the MSE. To do so, we check if 
reconstructed values x j for systematic samples in (l32l are consistent with their values before reconstruction 
or not, i.e., for any systematic sample, we must have Q{xi) = Q{yRi). Otherwise, we replace X{ with 



VII. Classification of Systematic Frames 

A. The Best and Worst Systematic Frames 

As we discussed in Section |V] the optimal G sys is achieved from the optimization problem ( fT9l ). 
Similarly, to find the worst G sys , we can maximize (TT9l) instead of minimizing it. The optimal eigenvalues 



semidefinite matrix. Further, at least > since otherwise E must be zero. Hence, Yli=i ~ decreases 




(34) 
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are known to be Aj = 1, 1 < i < k. But, how can we find the corresponding G sys , or Gk equivalently? 
More importantly, if a Gk with Xi = 1 does not exist, is there any suggestion for the best matrix? 

We approach this problem by studying another optimization problem. To this end, we first prove the 
following theorem for the eigenvalues of a matrix. 

Theorem 6. Let {Ai, A2, • • • , Xk} be the eigenvalues of a nonsingular k x k matrix A, then we have 

=1 1 i=\ 



where the constant c is a function oftv(A), . . . ,tr(^4 fc 1 ) . 

Proof: See Section HX-Bl 
Now, in view of Theorem Q, we can see that 



argmin — = argmax Aj. (36) 

As a result, the optimal arguments of the optimization problem in (fT9b are equal to those of 

k 



maximize j^J Aj 

k 



(37) 



s.t. Xi = k, Xi > 0, 



in which {Aj}^ =1 are the eigenvalues of G^G^ (or V k H Vk). By using the Lagrangian method, one can 
check that (l37l) has the maximum of 1 and infimum of 0. Then, considering that 



11 Xi = det(V k H V k ) = det(G fc Gf ), (38) 
i=i 

we conclude that the "best" submatrix is the one with the largest determinant (possibly 1) and the "worst" 
submatrix is the one with smallest determinant. 

Next, we evaluate the determinant of V k H Vk so as to find the matrices corresponding to the extreme 
cases. To this end, we first evaluate the determinate of WW H where W is the Vandermonde matrix with 
unit complex entries as defined in (f2]). From Q we can write 

det(WW H ) = \ Y[ \e ie - - e ie «\ 2 

l<p<q<n 



~ n n 11 — n w (39) 

l<p<q<n 

2»(n-l) , tt \n-r 

= sin — r , 

n n 11 V n ) 

r=l 
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in which X = ^-(x— l),r = q—p, and n(n— 1)/2 is the total number of terms that satisfy 1 < p < q < n. 

But, we see that W is a DFT matrix, and thus, its determinant must be 1. Therefore, we have 

n— 1 

/ . 2 7T \«-r 
I Gin — 7' I — 

2"(n-l) ' 



n( S in 2 ^r r =-^-. (40) 

11 V „ on(n-l) v ' 



n 

r=l 

The above analysis helps us evaluate the determinant of V k or G k , defined in ( fT6l ). Let T Tk = 
{i ri ,ir 2 , ■ ■ ■ ,ir k } be those rows of G used to build G k . Also, without loss of generality, assume i ri < 
i r2 < • • • < i Tk . Clearly, i ri > l,i rk < n, and we obtain 

det(V k V h H ) = 1 [J | e ^-e^| 2 

l<p<g<n 

1 vr (41) 

= ^fc II 4sin 2 -(g-p). 

l<p<g<n 

Then, since sin^u = sin^(n — u), one can see that this determinant depends on the circular distance 
between rows in Ir fe - For & matrix with n rows, we define the circular distance between rows p and q 
as min{|<7 —p\,n — \q — p\}. In this sense, for example, the distance between rows 1 and n is one, i.e., 
they are circularly successive. Now, it is reasonable to believe that (|4TI ) is minimized when the selected 
rows are (circularly) successive. Note that sinn is strictly increasing for u G [0, 7r/2], and the circular 
distance cannot be greater than n/2 in this problem. 

In such circumstances where all rows in Z Tk are (circularly) successive, (PiTl ) is minimal and reduces 

to 

ofc(fc — 1) 1 £,_ r 

det(^) = -^n( sin V) • («) 

r=l 

The other extreme case comes up when n = Mk (M is a positive integer) provided that Gk consists of 
every Mth row of G. In such a case, (|4~TT) simplifies to 1, because 



2k(k— 1) / yj- \ k—r 

det(y k V k H ) = -r^ II sin2 ^ Mr ) 



r=l 

2*(k-i) ^ 2 7T ^k-r (43) 



n 



. i , sin — r 

r=l 
= 1, 

where the last step is due to d40b . Recall that this gives the best V k (and equivalently G k ), in light of d37l ). 
For such a G k , it is easy to see that G sys stands for a "tight" systematic frame and minimizes the MSE 
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for a given number of quantization levels. Effectively, such a frame is performing integer oversampling. 
There are M such frames; they all have the same spectrum, though. 

Recall that, from the optimization problem in d36l)-d3~8l) and Theorem[3j det(VfcV^) < 1 for n ^ Mk. 
For such an (n, k) frame, systematic rows cannot be equally spaced in the corresponding systematic frame; 
instead, we may explore a systematic frame in which the circular distance between successive systematic 
samples is as evenly as possible. Then, the circular distance between each successive systematic rows is 
either [n / k\ or \n/k] . More precisely, if I and m, respectively, represent the number of systematic rows 
with circular distance equal to \n/k~\ and [n/k\, they must satisfy 

I + m = k, 

(44) 

5tl+mL|j=n. 

In the following theorem, we prove that the best performance is achieved when the systematic rows are 
as equally spaced as possible, i.e., when (l44l is satisfied. 

Theorem 7. When encoding with an (n, k) systematic DFT frame in (1151 ) and decoding with linear 
reconstruction, for the noise model (1231 ) and given a same number of quantization levels, the MSE is 
minimum when there are I = n — \ n/k\k systematic rows with successive circular distance \n/k~\ while 
the remaining m = k — I systematic rows have a successive circular distance equal to \n/k\. 

Proof: See Appendix IDTCl ■ 
Effectively, the above theorem is generalizing Theorem [5] Note that when n = Mk, [n/k\ = \n/k~\ = 
M and there exist k systematic rows with equal distance; in this case, Theorem [7] reduces to Theorem [5] 
and the corresponding systematic frame is tight. The optimality of this case was proved in (|43T ). When 
n / Mk, we cannot have a systematic frame with equally spaced systematic rows; however, the best 
performance is still achieved when the circular distance between systematic rows is as evenly as possible, 
as explained above. Note that in either case d m i n , the minimum distance between systematic rows, is 
[n/k\ . This is a necessary condition for an optimal systematic frame, as shown in the proof of Theorem[7] 

B. Numerical Examples 

Numerical calculations confirm that "evenly" spaced data samples gives rise to systematic frames with 
the best performance. When a systematic code is performing integer oversampling, we end up with tight 
systematic frames. The first and last codes in Table U are examples of this case. When n ^ Mk, data 
samples cannot be equally spaced; however, as it can be seen from the second code in Table Jl still the 
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TABLE I 

Eigenvalues structure for two systematic DFT frames with different codeword patterns. A "x" and "— " 

RESPECTIVELY REPRESENT DATA (SYSTEMATIC) AND PARITY SAMPLES. 



Code 




("! n H pwnrH 

V7UUC VV KJL l_l 

patern 




A min 


\ 

"max 




U k A 
lli=i Al 




X 


XX 




0.0572 


1.9428 


19 


0.1111 


(6,3) 


X 


X — X 




0.2546 


1.7454 


5.5 


0.4444 


















X 


X X — 




0.2546 


1.7454 


5.5 


0.4444 




X 


— X — X — 




1 


1 


3 


1 




X 


X X X X — 




0.0396 


1.4 


28.70 


0.0827 


(7, 5) 


X 


X X X — X 




0.1506 


1.4 


10.32 


0.2684 


















X 


X — X X — 


X 


0.3110 


1.4 


7.40 


0.4173 




X 


— XXX — 


X 


0.3110 


1.4 


7.40 


0.4173 




X 


X X X X — 




0.0011 


1.9989 


908.21 


4.46 x 10" 4 




X 


XXX — X 




U.UU41 


i.yyoy 


O/IO G/1 


U.UU4/ 




X 


XXX 


X 


0.0110 


1.9890 


96.09 


0.0122 


(10,5) 


X 


XX X 


X 


0.0496 


1.9504 


25.64 


0.0489 


















X 


X — X X — 


X 


0.0512 


1.9488 


23.41 


0.0838 




X 


X — X X — 


— X 


0.1056 


1.8944 


13.79 


0.1436 




X 


— X — X — 


X X 


0.2377 


1.7623 


7.77 


0.4189 




X 


— X — X — 


X — X — 


1 


1 


5 


1 



best performance is achieved when they are as equally spaced as possible. In this table, "x's" and "— 's" 
represent data and parity samples, respectively. Moreover, we observe that, circularly shifted codeword 
patterns behave the same (e.g., in the (7, 5) code, frames with pattern x — x x x — x and x x — x x — x 
have the same performance). Also, reversal of a codeword pattern yields a codeword with the same 

performance (e.g., x x — x is shifted version of reversed x x x— in the (6,3) code). These 

properties hold in general, as stated below. 

Property 1. Circular shift of X Tk , the systematic rows of a systematic frame with analysis frame G sys , 
does not change the spectrum of G^G sys . 
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Property 2. Reversal of I Tk yields a systematic frame with the same spectral properties. 



Proof: From (031 ) we obtain 




n/k 



(45) 



for % = 1, . . . , k. But GkGj: is invariant to the circular shift of rows of G that make X Tk , as long as all 
rows are shifted the same amount in the same direction. This can be seen from the proof of Lemma |2] in 
(fT4l) by defining r' = r + c where r' represent the shifted rows by a constant c and r 6 X rfe . This proves 
Property [TJ Likewise, let r" = n + 1 — r be the reversed row indices. Again, from ([T41) . it is clear that 



These properties together show that the frame operators of systematic frames (Gg ys G sys ), in which the 
"relative" circular distance among the systematic rows are the same, inherit the same spectrum and thus 
show the same performance. 

C. Number of Systematic Frames 

The number of systematic frames is obviously finite but their performance depends on the position of 
systematic rows, or equivalently, the position of data (or parity) samples in the associated codewords, and 
can be the same for different systematic frames. In what follows, we derive an upper and lower bound 
on the number of systematic frames with different spectrum. In other words, we categorize these frames 
based on their performance. To this end, we observe that the problem of finding k x k submatrices of an 
n x k matrix can be viewed as finding different fc-subsets of a set with n elements. This is given by the 
binomial coefficient Qj and is also equivalent to the number of systematic frames. As stated earlier in 
Property [T] circular shift of a codeword pattern does not change its spectrum, and so its performance. We 
define a coset as square submatrices that result in a same performance. Each coset has at least n elements 
(fc-subsets), as shown in Table ITT] To find these elements, it suffices to circularly shift a subset n times. 
Equivalently, for a given /c-subset, we simply add up 1 to each element of a subset. Note that, the subsets 
elements are k row indices of G nX fe and thus cannot be greater than n. Therefore, once a shifted index x 
becomes greater than n, we replace it with ((x)) n where ((x)) n = x — dn if dn + 1 < x < dn + n, d G Z. 
Obviously, each coset has at least n subsets since n — 1 circular shifts of a given subset are distinct; all 
these subsets have the same relative distance, though. This can be seen in Table ITT] Thus, it is clear that 
the number of cosets is the bounded by 



Property |2] holds. 




(46) 
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TABLE II 

Different cosets of (7, 3) DFT frame and their corresponding relative distances and spectrums. The 

COSET LEADERS ARE IN BOLDFACE. 







Ci 




C 2 






c 3 






c 4 




C s 


Leader 


1 


2 3 


1 


2 


4 


1 


2 


5 


1 


3 


5 


13 4 




2 


3 4 


2 


3 


5 


2 


3 


6 


2 


4 


6 


2 4 5 




3 


4 5 


3 


4 


(i 


3 


4 


7 


3 


5 


7 


3 5 6 




4 


5 6 


4 


5 


7 


1 


4 


5 


1 


4 


6 


4 6 7 




5 


6 7 


1 


5 


6 


2 


5 


6 


2 


5 


7 


1 O ( 




1 


6 7 


2 


6 


7 


3 


6 


7 


1 


3 


6 


1 2 6 




1 


2 7 


1 


3 


7 


1 


4 


7 


2 


4 


7 


2 3 7 


Distance 


1 


1 2 


1 


2 


3 


1 


3 


3 


2 


2 


3 


1 3 2 


Weight 




4 




6 






7 






7 




6 


Ai 


2.1558 


1.7539 


1.9066 


1.2673 


1.7539 


A 2 


0.8150 


1.1133 


0.8424 


1.1601 


1.1133 


A 3 


0.0292 


0.1328 


0.2510 


0.5726 


0.1328 



Let X r Tk denote the reversal of X Tk = {i Tl , i r2 , . . . , i rh }. We define 

i; k = ((n + l-Xr h )) n . (47) 

This operation is performed on every element of X Tk . One can see that reversal of a subset does not change 
its distance and spectrum, owing to Property 12 This can reduce the number of cosets. For example, in 
Table HT1 the reversal of {1, 2, 4}, which is the coset leader in C2, is {7, 6, 4} which belongs to C5. This 
indicates C2 and C5 are essentially one coset. The bound in (l46l) is tight if and only if there are u self- 
reversal cosets. Trivial examples of such a code are achieved when k = n — 1 or = 1. A self-reversal 
coset is a coset that the reversal of its elements belong to itself, e.g., Ci, C3, and C4 in Table HTl 

On the other hand, n c > u/2 is a lower bound because there cannot be more than one reversal for a 
given coset. It can be further seen that the coset with smallest weight (Ci) is always self-reverse, i.e., 
the reversal of each element of Ci is its own element for any (n, k) frame. This implies that the lower 
bound is not achievable. Therefore, 



July 27, 2012 



DRAFT 



23 



VIII. Conclusions 

We have introduced, proposed the construction method, and analyzed the performance of systematic 
DFT frames in this paper. Several systematic DFT frames can be made out of the generator matrix of a 
BCH-DFT code. The performance of these frames, however, depends on the relative position of systematic 
samples or, equivalently, parity samples in the codeword. We proved that evenly spaced systematic (parity) 
samples result in the minimum mean-squared reconstruction error, whereas the worst performance is 
expected when systematic samples are circularly consecutive. Further, we found the conditions for which 
a systematic DFT frame can be tight, too. Tight systematic DFT frame can be realized only if the frame 
is performing integer oversampling and systematic samples are circularly equally spaced. Finally, for 
each DFT frame, we classified systematic DFT frame based on their performance. 

It would be interesting to extend this work to oversampled DFT filter banks, an infinite-dimension of 
DFT frames. Oversampled filter banks can be used as error correcting codes ll30l . In a relevant work, 
systematic wavelet subcodes, a form of real-number convolutional codes, has been proposed for data 
protection lETTl . 
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IX. Appendix 

A. Proof of Theorem \3\ 

Proof: Let n = Mk+l, < I < k, then G can be partitioned as G = [Gf | G\ H \ ■ ■ ■ \ g[ M ~ 1)H | Gf x f } H . 
In general, Gk, G\, . . . , G^ 1 ^ 1 and Gjf xl include arbitrary rows of G, hence they have different spectrums, 
i.e., different sets of eigenvalues. Suppose, for the purpose of contradiction, that \k(G^Gk) = 1; this 
can occur only if Gk consist of the rows of G such that the distance between each two successive rows 
is at least MO Such an arrangement guarantees the existence of G\, . . . , Gjr -1 so that G k nH G k n , for any 
1 < m < M — 1, has the same spectrum as G^Gk- To find the row indices corresponding to C7£\ we can 
simply add m to each row index of G k . Then, to show these matrices have the same spectrum, we use 
Lemma [B Given a G k , one can verify that {Gf)^ = e j2 ^(G k ) itj and thus (Gf)^ = e~ j ^ (G fc )g. 

5 \k{Gfr Gk) = 1 is the optimal solution for j 1 9b and necessitate d m i n = M, as discussed in Theorem[7] 
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Therefore, G k nH G k n and G^G k have the same spectrum for any 1 < m < M — 1. Next, we see that 
G H G = A + B in which A = G% G k + - ■ ■ + G^ t ~ 1)H G^f _1 and B = G^G^ xl . Then, in consideration 
of the above discussion, Xi(A) = MXi(G^G k ) for any 1 < i < k. Hence, from ([8]), for i = 1, j = k, 
we will have 

X k (A) + X 1 (B) < X 1 (A + B) 
MX k (G^G k ) <-£ — Ai(-B) 
* A fc (Gf G k ) < A_ = iL_ < i, 



fc -v-/ (49) 

n -| n 1 

^ = 7|] 

where the last line follows using X\(B) > 1 from Theorem [TJ But this is contradicting our assumption 
X k (G^G k ) = 1, and thus completes the proof that, for n ^ Mk, the largest possible X k (G^G k ) is 
strictly less than 1, for any G k U 

The proof of the other bound (Xi(G^G k ) > 1) is then immediate because X]i=i ^i(G k G k ) = 



B. Proof of Theorem [6] 

Proof: Let A be a x k matrix. The eigenvalues of A are the roots of the characteristic polynomial 



defined as 



It can be shown that 



P(A) = jj(A - Xi). (50) 



A- 



Next, by setting A = we get 



m = y^ (si) 

P(A) ^A-A/ PiJ 

(g^)-(n^) = ("l) fc+1 ^(0) = (52) 

Clearly P'(0) is a constant for a given matrix A; it is a function of eigenvalues of A. To find this constant, 
we use Newton's identities to re-express the characteristic polynomial in (1501 as 

P(A) = Po X k + piA^ 1 + . . . + p^X + Pk , (53) 

6 Note that when n = Mk, B is an empty matrix and we must put Xi(B) = in l |49t which result in \k(Gf? Gk) < 1 and 
does not guarantee a bound strictly less than 1. 
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in which po = 1 and, for j = 1, . . . , k, 

3-1 



1 

Pj = —-^ZviSj-h 



1=0 



(54) 



A' 



S J = E A '' =M^'). (55) 



Thus, the constant c in (l52l) is given by c = (— l) fc+1 PA:-i which is a function of tx(A), . . . ,tv(A 



C. Proof of Theorem [7| 

Proof: Consider an (n, k) DFT frame, let M = [n/k\, and assume that all rows in Z Tk , except the 
first and last rows, are equally spaced with distance M (without loss of generality, we assume i ri = 1, 
then i rj = (j — 1)M + 1, j < k). Hence d m i n = M, where the minimum distance d m i n is defined as the 
smallest circular distance among the selected rows. In such a setting, from (HTb and similar to d43l) , we 
can write 

nA;(A;— 1) 1 

det(^) = n (sin 2 -Mr) . (56) 

r=l 

We prove that, in view of (I37T ). the systematic frame corresponding to the above arrangement has better 
performance than any other arrangement in which the minimum distance among the systematic rows is 
less than M. To this end, we first assume that all selected rows in X rk remain the same except one row 
which is shifted one unit in a way that d m i n decreases. For example, without loss of generality, consider 
T' Tk for which i' ri = 2, i' r . = i r ., 1 < j < k; hence d m i n = M — 1. Then, from (|4TT >. we obtain 

det(V k V k H )\ x , k n£iWs(Mr-l) 



det(V k V k H )\ Irk ~ H k r Zl S in 2 ^Mr 



< 1. (57) 



To prove the inequality, equivalently, we show that 



. (M-1W - (2M-1W - ((fe-l)M-l)7r 
sin i — sin — ■ ■ ■ sin M — 

sm T sm- sin 

We break up this inequality into \Jk/2\ inequalities, each of which strictly less than one. First, consider 

the first and last terms in the numerator and denominator. We can write 

. (JU-1W . ((fc-lUf-lW (k-2)MTT (fcM-2W 
sm — sm — — cos i cos — 



sin Mil sin (hzlMjL cos (±z3^M2L _ cos Mdi 

n n n n 



< 1, (59) 
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where the inequality follows since cos > cos Mpt, as < tt. Likewise, for the second and 

penultimate terms we have 

. (2M-1W . ((fc-2)M-l)7r (k-4)Mn (kM-2)iT 

sin ^ — sm ^ '- — cos ^ ' cos i — 



■ 2Mtv ■ (k-2)Mir (fc-4)A/7r kM-K 

sm ^zr- sm ^ — -f cos - — -f cos ^^^zr- 

< 1. (60) 

A similar reasoning can be used for other terms that are equally spaced from the two ends. 

Clearly, the same argument is valid when 2 < i' < M and the other rows are the same, i.e., 
i' r . = < j < k and d m i n = M — i' Tl . Moreover, when more than one row index is changed, in a 
way that two or more selected rows have a distance less than M, the above argument is valid and we 
can show that new determinant is even less than the case with one changed index. In fact, in such a case, 
it is easier to compare the new one with its parent; i.e., to compare the case with two changes with the 
case with one change. As a result, we can see that any combination of rows with d m i n < M performs 
worse than the case with d m i n = M, on account of (37); that is, d m i n = M is necessary condition for 
optimality. In other words, that optimal systematic frame must satisfy d m i n = M. 

Next, we show that among systematic frames with d m i n = M the one that satisfies (1441) is the best. That 
is, the optimal systematic frame has I = n — [n/k\k systematic rows with successive circular distance of 
\n/k~\ and m = k — I systematic rows with successive circular distance of [n/k\. To prove this, again 
we compare det(V^V^) in (|4TT) for this case and the other cases with d m i n = M. The arguments are 
very similar to what we used above. Before moving on, we should mention that for I G {0,1, fc- 1} the 
proof in the first part is sufficient. 

Let Z° k denote the set of rows satisfying the constraints in (l44l) ; obviously, d m i n = M. We claim that 
any other selection of systematic rows, for which d m i n is M, results in a smaller det(VfcV^); that is, 
det(VfcV^)|z rfc < det(VfcV^)|2° . Let us evaluate the case where only the row index for one of those I 
rows varies, provided that d m i n = M is keptQ We then have 

det(V k V k H )\ Xrk rfc 1 sin 2 £Mr 



det(V k V k H )\ I?k n*=i sin2 K Mr + 1) 

Again it suffice to prove that 



< 1. (61) 



sm ^-^ sm ^^-^ • • • sm * '- 

n n n 

- (Af+lW - (2M+1W ■ ((fc-l)M+lW 
sm i '— sm i '— ■ ■ ■ sm ' '— 



< 1, (62) 



7 Note that, with this shift of row, we are looking for an arrangement of a systematic frame that does not satisfy j44b : 
otherwise, det(VfcV fc H ) will not vary, as the frame properties has not changed essentially. More specifically, a new, different 
arrangement will introduce a new distance equal to \n/k] +1. 
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and this can be done by the same divide and conquer approach, used in the first part of this proof. For 
instance, for the first and last terms in the numerator and denominator we have 



gin Mjt gin MM, CQS (k-^Mn _ kM^ 

n n n n 



sin (M+DE sin W°-i)M+i)« (k-m. _ 

n n n n 

< 1, (63) 

where the inequality follows for cos ^ kM + 2 ^ 7T < CO s hMjL_ Finally, the other cases, where two or more rows 
change, can be proved comparing their determinant with their ancestor's with a similar reasoning. This 
completes the proof that a systematic frame with the most evenly spaced systematic rows, or equivalently 
data samples in the corresponding codewords, is the best in the minimum MSE sense. 
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